Identifying the influence of transfer learning method in developing an end-to-end automatic speech recognition system with a low data level

نویسندگان

چکیده

Ensuring the best quality and performance of modern speech technologies, today, is possible based on widespread use machine learning methods. The idea this project to study implement an end-to-end system automatic recognition using methods, as well develop new mathematical models algorithms for solving problem agglutinative (Turkic) languages. Many research papers have shown that deep methods make it easier train systems approach. This method can also directly, is, without manual work with raw signals. Despite good quality, model has some drawbacks. These disadvantages are need a large amount data training. serious low-data languages, especially Turkic languages such Kazakh Azerbaijani. To solve problem, various needed apply. Some used belonging group same family (agglutinative languages). Method low-resource transfer learning, resources – multi-task learning. increase efficiency quickly associated limited resource, was model. helped fit trained dataset Azerbaijani dataset. Thereby, two language corpora were simultaneously. Conducted experiments show reduce symbol error rate, phoneme rate (PER), by 14.23 % compared baseline (DNN+HMM, WaveNet, CNC+LM). Therefore, realized be recognize other

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

the aesthetic dimension of howard barkers art: a frankfurtian approach to scenes from an execution and no end of blame

رابطه ی میانِ هنر و شرایطِ اجتماعیِ زایش آن همواره در طولِ تاریخ دغدغه ی ذهنی و دل مشغولیِ اساسیِ منتقدان و نیز هنرمندان بوده است. از آنجا که هنر در قفس آهنیِ زندگیِ اجتماعی محبوس است، گسترش وابستگیِ آن با نهاد ها و اصولِ اجتماعی پیرامون، صرفِ نظر از هم سو بودن و یا غیرِ هم سو بودنِ آن نهاد ها، امری اجتناب ناپذیر به نظر می رسد. با این وجود پدیدار گشتنِ چنین مباحثِ حائز اهمییتی در میان منتقدین، با ظهورِ مکتب ما...

Wav2Letter: an End-to-End ConvNet-based Speech Recognition System

This paper presents a simple end-to-end model for speech recognition, combining a convolutional network based acoustic model and a graph decoding. It is trained to output letters, with transcribed speech, without the need for force alignment of phonemes. We introduce an automatic segmentation criterion for training from sequence annotation without alignment that is on par with CTC [6] while bei...

متن کامل

Improving End-to-End Speech Recognition with Policy Learning

Connectionist temporal classification (CTC) is widely used for maximum likelihood learning in end-to-end speech recognition models. However, there is usually a disparity between the negative maximum likelihood and the performance metric used in speech recognition, e.g., word error rate (WER). This results in a mismatch between the objective function and metric during training. We show that the ...

متن کامل

End-to-End Deep Neural Network for Automatic Speech Recognition

We investigate the efficacy of deep neural networks on speech recognition. Specifically, we implement an end-to-end deep learning system that utilizes mel-filter bank features to directly output to spoken phonemes without the need of a traditional Hidden Markov Model for decoding. The system will comprise of two variants of neural networks for phoneme recognition. In particular, we utilize conv...

متن کامل

End-to-end Video-level Representation Learning for Action Recognition

From the frame/clip-level feature learning to the videolevel representation building, deep learning methods in action recognition have developed rapidly in recent years. However, current methods suffer from the confusion caused by partial observation training, or without end-to-end learning, or restricted to single temporal scale modeling and so on. In this paper, we build upon two-stream ConvN...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Eastern-European Journal of Enterprise Technologies

سال: 2022

ISSN: ['1729-3774', '1729-4061']

DOI: https://doi.org/10.15587/1729-4061.2022.252801